An Information Theoretic Approach to Rescoring Peptides Produced by De Novo Peptide Sequencing

نویسندگان

  • John R. Rose
  • James P. Cleveland
  • Alvin Fox
چکیده

Tandem mass spectrometry (MS/MS) is the engine driving high-throughput protein identification. Protein mixtures possibly representing thousands of proteins from multiple species are treated with proteolytic enzymes, cutting the proteins into smaller peptides that are then analyzed generating MS/MS spectra. The task of determining the identity of the peptide from its spectrum is currently the weak point in the process. Current approaches to de novo sequencing are able to compute candidate peptides efficiently. The problem lies in the limitations of current scoring functions. In this paper we introduce the concept of proteome signature. By examining proteins and compiling proteome signatures (amino acid usage) it is possible to characterize likely combinations of amino acids and better distinguish between candidate peptides. Our results strongly support the hypothesis that a scoring function that considers amino acid usage patterns is better able to distinguish between candidate peptides. This in turn leads to higher accuracy in peptide prediction. Keywords—Tandem mass spectrometry, proteomics, scoring, peptide, de novo, mutual information

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved De Novo Peptide Sequencing using LC Retention Time Information

Liquid chromatography combined with tandem mass spectrometry (LC-MS/MS) is an important tool in proteomics for identifying the peptides in a sample. Liquid chromatography temporally separates the peptides and tandem mass spectrometry analyzes the peptides, that elute one after another, by measuring their mass-to-charge ratios and the mass-to-charge ratios of their prefix and suffix fragments. D...

متن کامل

Multi-spectra peptide sequencing and its applications to multistage mass spectrometry

Despite a recent surge of interest in database-independent peptide identifications, accurate de novo peptide sequencing remains an elusive goal. While the recently introduced spectral network approach resulted in accurate peptide sequencing in low-complexity samples, its success depends on the chance of presence of spectra from overlapping peptides. On the other hand, while multistage mass spec...

متن کامل

De novo sequencing of peptides by MS/MS.

The current status of de novo sequencing of peptides by MS/MS is reviewed with focus on collision cell MS/MS spectra. The relation between peptide structure and observed fragment ion series is discussed and the exhaustive extraction of sequence information from CID spectra of protonated peptide ions is described. The partial redundancy of the extracted sequence information and a high mass accur...

متن کامل

De novo sequencing, peptide composition analysis, and composition-based sequencing: a new strategy employing accurate mass determination by fourier transform ion cyclotron resonance mass spectrometry.

A new strategy is described for the determination of amino acid sequences of unknown peptides. Different from the well-known but often inefficient de novo sequencing approach, the new method is based on a two-step process. In the first step the amino acid composition of an unknown peptide is determined on the basis of accurate mass values of the peptide precursor ion and a small number of accur...

متن کامل

Open - pNovo : De Novo Peptide Sequencing with Thousands of 2 Protein

9 ABSTRACT: De novo peptide sequencing has improved remarkably, 10 but sequencing full-length peptides with unexpected modifications is 11 still a challenging problem. Here we present an open de novo 12 sequencing tool, Open-pNovo, for de novo sequencing of peptides 13 with arbitrary types of modifications. Although the search space 14 increases by ∼300 times, Open-pNovo is close to or even ∼10...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012